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"c/3 ■ Abstract 

o 

This report defines various operations and describes algorithms for weighted multi-tape automata 
(WMTAs). It presents, among others, a new approach to multi-tape intersection, meaning the inter- 
section of a number of tapes of one WMTA with the same number of tapes of another WMTA, which 
can be seen as a generalization of transducer intersection. In our approach, multi-tape intersection is 
not considered as an atomic operation but rather as a sequence of more elementary ones. We show an 
example of multi-tape intersection, actually transducer intersection, that can be compiled with our ap- 
proach but not with several other methods that we analyzed. Finally we describe an example of practical 
application, namely the preservation of intermediate results in transduction cascades. 
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1 Introduction 

Finite state automata (FSAs) and weighted finite state automata (WFSAs) are well known, mathemati- 
cally well defined, and offer many practical advantages. (Elgot and Mezei, 1965; Eilenberg, 1974; Kuich 
and Salomaa, 1986). They permit, among others, the fast processing of input strings and can be easily 
modified and combined by well defined operations. Both FSAs and WFSAs are widely used in language 
and speech processing (Kaplan and Kay, 1981; Koskenniemi, Tapanainen, and Voutilainen, 1992; Sproat, 
1992; Karttunen et al., 1997; Mohri, 1997; Roche and Schabes, 1997). A number of software systems 
have been designed to manipulate FSAs and WFSAs (Karttunen et al., 1997; van Noord, 1997; Mohri, 
Pereira, and Riley, 1998; Beesley and Karttunen, 2003). Most systems and applications deal, however, 
only with 1-tape and 2-tape automata, also called acceptors and transducers, respectively. 

Multi-tape automata (MTAs) (Elgot and Mezei, 1965; Kaplan and Kay, 1994) offer additional ad- 
vantages such as the possibility of storing different types of information, used in NLP, on different tapes 
or preserving intermediate results of transduction cascades on different tapes so that they can be re- 
accessed by any of the following transductions. MTAs have been implemented and used, for example, 
in the morphological analysis of Semitic languages, where the vowels, consonants, pattern, and surface 
form of words have been represented on different tapes of an MTA (Kay, 1987; Kiraz, 1997; Kiraz and 
Grimley-Evans, 1998). 

This report defines various operations for weighted multi-tape automata (WMTAs) and describes 
algorithms that have been implemented for those operations in the WFSC toolkit (Kempe et al., 2003). 
Some algorithms are new, others are known or similar to known algorithms. The latter will be recalled to 
make this report more complete and self-standing. We present a new approach to multi-tape intersection, 
meaning the intersection of a number of tapes of one WMTA with the same number of tapes of another 
WMTA. In our approach, multi-tape intersection is not considered as an atomic operation but rather as a 
sequence of more elementary ones, which facilitates its implementation. We show an example of multi- 
tape intersection, actually transducer intersection, that can be compiled with our approach but not with 
several other methods that we analyzed. To show the practical relevance of our work, we include an 
example of application: the preservation of intermediate results in transduction cascades. 

For the structure of this report see the table of contents. 

2 Some Previous Work 

2.1 n-Tape Automaton Seen as a Two-Tape Automaton 

Rabin and Scott (1959) presented in a survey paper a number of results and problems on finite 1-way 
automata, the last of which - the decidability of the equivalence of deterministic k-tape automata - has 
been solved only recently and by means of purely algebraic methods (Harju and Karhumaki, 1991). 

Rabin and Scott considered the case of two-tape automata claiming this is not a loss of generality. 
They adopted the convention "... that the machine will read for a while on one tape, then change control 
and read a while on the other tape, and so on until one of the tapes is exhausted . . .". In this view, a 
two-tape or n-tape machine is just an ordinary automaton with a partition of its states to determine which 
tape is to be read. 
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2.2 n-Tape Automaton Seen as a Single-Tape Automaton 

Ganchev, Mihov, and Schulz (2003) define the notion of "one-letter fc-tape automaton" and the main 
idea is to consider this restricted form of /c-tape automata where all transition labels have exactly one 
tape with a non-empty single letter. Then they prove that one can use "classical" algorithms for 1-tape 
automata on a one-letter /c-tape automaton. They propose an additional condition to be able to use 
classical intersection. It is based on the notion that a tape or coordinate is inessential iff \/(w\, ..., Wk) € 
R (R is a regular relation over (S*) fc ) and \/v € S*, (w\, ...t0j_i,u, Wi+i, ■■■,Wf ! ) € R. And thus to 
perform an intersection, they assume that there exists at most one common essential tape between the 
two operands. 

2.3 n-Tape Transducer 

Kaplan and Kay (1994) define a non-deterministic n-way finite-state transducer that is similar to a classic 
transducer except that the transition function maps Q x E e x ... x E € to 2^ (with E e = SU {e})- To 
perform the intersection between two ra-tape transducers, they introduced the notion of same-length 
relations . As a result, they treat a subclass of n-tape transducers to be intersected. 

Kiraz (1997) defines an n-tape finite state automaton and an n-tape finite-state transducer, introduc- 
ing the notion of domain tape and range tape to be able to define a unambiguous composition for ?7,-tape 
transducers. Operations on n-tape automata are based on (Kaplan and Kay, 1994) , the intersection in 
particular. 

3 Mathematical Objects 

In this section we recall the basic definitions of the algebraic structures monoid and semiring, and give a 
detailed definition of a weighted multi-tape automaton (WMTA) based on the definitions of a weighted 
automaton and a multi-tape automaton (Rabin and Scott, 1959; Elgot and Mezei, 1965; Eilenberg, 1974; 
Kuich and Salomaa, 1986). 

3.1 Semirings 

A monoid is a structure (M, o, 1) consisting of a set M, an associative binary operation o on M, and 
a neutral element 1 such that 1 o a = a o 1 = a for all a G M. A monoid is called commutative iff 
a o b = b o a for all a, b € M . 

A set K equipped with two binary operations, © (collection) and (8) (extension), and two neutral 
elements, and 1, is called a semiring, iff it satisfies the following properties: 

1. (K, ©, 0) is a commutative monoid 

2. (K, 0, 1) is a monoid 

3. extension is left- and right-distributive over collection: 

a<g>(6©c) = (a<g>6) ®(a<g>c), (a © 6) <g> c= (a© c) © (6© c) , Va,b,ceK 

4. is an annihilator for extension: 0®a = a©0 = 0, Va€l€ 
We denote a generic semiring as K = (K, ffi, ®, 0, 1). 



Kempe, Guingne, Nicart. Algorithms for n-Tape Automata. XRCE Report 2004/031 5 

Some automaton algorithms require semirings to have specific properties. Composition, for example, 
requires it to be commutative (Pereira and Riley, 1997; Mohri, Pereira, and Riley, 1998) and e-removal 
requires it to be k-closed (Mohri, 2002). These properties are defined as follows: 

1. commutativity: a®b = b® a , Va,6GK. 

fc+l k 

2. k-closedness: a n = a n , VaGK 

n=0 n=0 

The following well-known semirings are commutative: 

1. B = (1, V, A, 0, 1} : the boolean semiring, with B = {0, 1} 

2. TV = (N, +, x , 0, 1} : a positive integer semiring with arithmetic addition and multiplication 

3. TZ + = (M + , +, x , 0, 1) : a positive real semiring 

4. 1Z = (M , min, +, oo, 0} : a real tropical semiring, with R = M + U {oo} 

A number of algorithms require semirings to be equipped with an order or partial order denoted by 
<!C- Each idempotent semiring K (i.e., Va G K, : a © a = a) has a natural partial order defined by 
a <fc b 43- a © b = a. In the above examples, the boolean and the real tropical semiring are idempotent, 
and hence have a natural partial order. 



3.2 Weighted Multi-Tape Automata 

In analogy to a weighted automaton and a multi-tape automaton (MTA), we define a weighted multi-tape 
automaton (WMTA), also called weighted n-tape automaton, over a semiring K, as a six-tuple 



A (n) =def (X,Q,I,F,EW,K) 



(1) 



with 



s 




Q 




i 


CQ 


F 


QQ 


n 




#(») 


CQx (E*) n x I 


K. 


= (K, 0,8,6,1) 



For any state q G Q, 

X(q) G K 

q(q) g /C 



being a finite alphabet 
the finite set of states 
the set of initial states 
the set of final states 

the arity, i.e., the number of tapes of A^ 
x Q being the finite set of n-tape transitions and 
the semiring of weights. 

denotes its initial weight, with X(q) ^ 44> q G /, 
its final weight, with g(q) ^ <3> q G F, and 
its finite set of out-going transitions. 



For any transition e^ G E^ n \ with e( n ) = (p, &^> , w, n), 

p(e^) p : E^ — > Q denotes its source state 

£(e^ ) £ : fiW -» (S*) n its label, which is an ra-tuple of strings 

w(e^) w.E^K its weight, with w(e( n )) /0<^e( n ) G ^W, and 

n(e^) n : E — > Q its target state 
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A path 7r( n ) of length r = \tt^\ is a sequence of transitions e-j" - eJ, ' ■ ■ Zr sucn that n(e' ) = 
p( e i+i) f° r ai l * ^ [1) r ~ !]• ^ P atn i s sa id to be successful iff p(e^ n ) G I and n(ef. ) € -F 1 . In the 
following we consider only successful paths. The label of a successful path -ir^ equals the concatenation 
of the labels of its transitions 

<(ttW) = *(e£ n) ) £(4*°) ■■■ £(4 n) ) (2) 

and is an n-tuple of strings 

*(7r(»>) = S W = ( Sl , S2 ,..., Sn ) (3) 

If all strings Sj G X* (with j € [1, n]) of a tuple s^ n ' are equal, we use the short-hand notation s™ on 
the terminal string Sj. For example: 

( a fr c )( 3 ) = (abc, abc, abc) (4) 

e< 4 > = (e, e,e,e) (5) 

The n strings on any transition e^ are not "bound" to each other. For example, the string triple 
s( 3 ) = (aaa, bb, cccc) can be encoded, among others, by any of the following sequences of transitions: 
(a:b:cc)(a:b:c)(a:£:c) or (aa:£:£)(a:b:cc)(e:b:cc) or {aaa:bb:cccc){e:e:e), etc. 
The weight w(tt^) of a successful path is 



u;(ttW) A(p(eH) ® [ (&) w ( e ) ) ) ® <?( n(e™) ) (6) 

We denote by II(.A( n )) the (possibly infinite) set of successful paths of A^ and by Il(s^) the 
(possibly infinite) set of successful paths for the n-tuple of strings s^ n ' 

n(s^) = { Tr^eri^™)) I s ^=£(tt^) } (7) 

We call ~R{A^ n ') the n-ary or ra-tape relation of AS n > . It is the (possibly infinite) set of n-tuples of 
strings s^ having successful paths in A^: 

7e (n) = K{A^) = { s (n) | 3^ n )Gn(^ n ))A^(vr (?l) ) = s (n) } (8) 

The weight for any ?7,-tuple of strings s^ G TZ(A^) is the collection (semiring sum) of the weights of 
all paths labeled with s^ n ' : 

w(s^) = w(tt^) (9) 

7r( n )en(s( n )) 

By relation we mean simply a co-occurrence of strings in tuples. We do not assume any particular 
relation between those strings such as an input-output relation. All following operations and algorithms 
are independent from any particular relation. It is, however, possible to define an arbitrary weighted 
relation between the different tapes of TZ{A^ n '). For example, TZ(A^ 2 ') of a weighted transducer A^- 2 ' is 
usually considered as a weighted input-output relation between its two tapes, that are called input tape 
and output tape. 

In the following we will not distinguish between a language C and a 1-tape relation TIS 1 ), which 
allows us to define operations only on relations rather than on both languages and relations. 
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4 Operations 

This section defines operations on string n-tuples and n-tape relations, taking their weights into account. 
Whenever these operations are used on transitions, paths, or automata, they are actually applied to their 
labels or relations respectively. For example, the binary operation 6 on two automata, A\ 5 A^ 1 , ac- 
tually means 7Z(A[ n 6 A^ ) = lZ(Ai') oTZ^A^ 1 ). The unary operation 6 on one automaton, 6 A^ n \ 
actually means TZ{ 6 A^) = 6 TZ(A^). 

Ultimately, we are interested in multi-tape intersection and transduction. The other operations are 
introduced because they serve as basis for the two. 

4.1 Pairing and Concatenation 

We define the pairing of two string tuples, s^ n ' : v^ m ' = u^ n+m ' , and its weight as 

(si,... ,s n ) : (vi,...,v m ) = de f (si,... ,s n ,vi,... ,v m ) (10) 

w((si,...,s n ) : (vi,...,v m )) = dcf w({s 1 ,...,s n ))®w({vi,...,v m }) (11) 

Pairing is associative (concerning both the string tuples and their weights) : 

J ('ni). c ('«2). c (n3) _. ( a (m) . S M\ . S M = s (ni). ( gM :s ( n a)\ = s (ni+n 2 +n 3 ) q 2 ) 



We will not distinguish between 1-tuples of strings and strings, and hence, instead of gW.-t/ 1 ) or (s):(v), 
simply write s:v. 

The concatenation of two string tuples of equal arity, s^ n 'v^ n ' = vS n \ and its weight are defined as 

(si,...,s n )(vi,...,v n ) = dc f (siv 1 ,...,s n v n ) (13) 

w((s 1 ,...,s n )(v 1 ,...,v n )) = dc f w((si,...,8 n ))<g>w({vi,...,v n )) (14) 

Concatenation is associative (concerning both the string tuples and their weights) : 

S W S W S W = ( fl W fl W) 4 n > = ,[") (4 n) 4 n) ) = 8 W (15) 

Again, we will not distinguish between 1-tuples of strings and strings, and hence, instead of sW^W or 
(s)(v), simply write sv. 

The relation retween pairing and concatenation can be expressed through a matrix of string tuples 



„(«i) „("i) 

'11 ' ' ' b lr 



(n m ) (rim) 

'ml ' ' ' b mr 



(16) 



where the s- k are horizontally concatenated and vertically paired: 

(ni+...+n m ) /"«( ni ) ... «( ni A • • ^ Q ( n ™) . . „(n m ) 



' S lr J : ' ' ' : I S ml ' ' ' S mr 
i^" 1 ) • ... • _^ nm A . . . ( s [ n ^ ■ ... ■ S K™A (17) 



11 ■ ■ "ml / \°lr ■ ■ °mr 



(n) 

Note, this equation does not hold for the weights of the s- k , unless they are defined over a commutative 
semiring K. 
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4.2 Cross-Product 

The cross-product of two re-tape relations, IZ™ x IZ™ = W- n+m ' , is based on pairing and is defined as 

n^ x k { ™ ] = dC f { * (n) : v^ | s w e nf\v^ e 7^ m) } (is) 

The weight of each string tuple vS n+m ' g TZ^ 1 ' x 72^ m follows from the definition of pairing. 

The cross product is an associative operation. 

A well-know special case is the cross-product of two acceptors (1-tape automata) leading to a trans- 
ducer (2-tape automaton) : 

A® = A^xA^ (19) 

1Z{A^) = {s:v | seniA^veniA^)} (20) 

w A (s:v) = w Al (s) <g> wa 2 {v) (21) 

4.3 Projection and Complementary Projection 

The projection, Vj k,..X s )> °f a sa "ing tuple is defined as 

"Pj,k,..X (si,...,s n ) ) =dcf (sj,s k ,...) (22) 

It retains only those strings (i.e., tapes) of the tuple that are specified by the indices j,k,... € [1, n], and 
places them in the specified order. Projection indices can occur in any order and more that once. Thus 
the tapes of s^ can, e.g., be reversed or duplicated: 

Vn,...,i( (si,...,s n ) ) = (s n ,...,si) (23) 

V 0,3,5 ( ( s i » • • • ' S n) ) = ( s j, s j, s j) ( 24 ) 

The weight of the re-tuple s^ is not modified by the projection (if we consider s^ not as a member of 
a relation). 

The projection of an n-tape relation is the projection of all its string tuples: 

Pj,fc,...(ft (n) ) =def l> M | 3sWe^ n) AP iA ...(sW) = ^ m )} (25) 

The weight of each v ^ S Vj,k,..X^- ) * s ^ e collection (semiring sum) of the weights of each s^ n > £ 
TZ^ leading, when projected, to y( m ): 

^ (m) ) =dcf w(s^) (26) 

s( ?l ) I Pj,k,...(s( n ))=v( m ) 

The complementary projection, Vj t k,..X s )> °f a string re-tuple s^ n ' removes all those strings (i.e., 
tapes) of the tuple that are specified by the indices j,k, . . . G [1, re], and preserves all other strings in 
their original order. ' It is defined as 

Vj^.-X ( s l) • • • > s ") ) = dcf (• • • ) S j-1j s J+1) • • • > s fc-i> s fc+l> • • •) (27) 



1 Contrary to other authors, we do not call V ( ) an inverse projection because it is not the inverse of a projection in the sense: 
a = V(J3) and /3 = V' 1 (a). 
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Complementary projection indices can occur in any order, but only once. 

The complementary projection of an re-tape relation equals the complementary projection of all its 
string tuples: 

V jyK .{Tl^) = dcf {v^ | 3sW CUM A%...(s ( " ) ) = d W } (28) 

The weight of each v^ m > E Vjk,...C^ ) is the collection of the weights of each s^ € TZ^ leading, 
when complementary projected, to v^ m ' : 

w(v {m) ) =dcf w(s^) (29) 

4.4 Auto-Intersection 

We define the auto-intersection of a relation, Tj t k(TZ^), on the tapes j and k as the subset of JZ^ that 
contains all s^ n ' with equal Sj and Sk' 

ijA # (n) ) =dcf { * (n) enV) | Sj = Sk } oo) 

The weight of any s^ n ' G2j \k{TZ ) is not modified. 
For example (Figure 1) 

izf ] = (a,x,e) (b,y,a)* {e,z,b) = { (ab k ,xy k z,a k b) | fceN} (31) 

l h3 (TZf ] ) = {(ab L ,xy L z,a 1 b)} (32) 

Auto-intersection of regular n-tape relations is not necessarily regular. For example (Figure 3) 

izf = (a,e,x)* (a,a,y) {e,a,z}* = { {a k a,aa h ,x k yz h ) \ k,h£N } (33) 

l^ 2 (TZf) = {(a k a,aa k ,x k yz k ) | fceN} (34) 

The result is not regular because x k yz k is not regular. 

4.5 Multi-Tape and Single-Tape Intersection 

The multi-tape intersection of two multi-tape relations, 7c\ and 1Z 2 > uses r tapes in each relation, 
and intersects them pair- wise. The operation pairs each string tuple s^ n ' G TZ^ with each string tuple 

v ( m ) £ JZ2 iff Sj t = Vk t with ji € [1, n], fcj e [1, rre] for all i e [1, r]. Multi-tape intersection is defined 



as: 



izf ] n lZ { 2 m) = TZ in+m - r) (35) 



31, h 
Jr.fc 



=def {u^ +m -^ | IsWelZ^^vWelzW, Sji =v ki , *€[l,n],fee[l,m],Vt€[l,r] 

u (n+m- -) = ^ n+feiv jn+fer ( s (-) : „M) } 

All tapes fcj of 7^2 that have directly participated in the intersection are afterwards equal to the tapes 

(n) 

ji of 1Z\ , and are removed. All tapes ji are kept for possible reuse by subsequent operations. All other 
tapes of both relations are preserved without modification. 
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The weight of each n ( n+m " r ) 6 n { - n+m -^ is 

w( u (n+m-r) j = «;(>)) g, ^M) (36) 

This weight follows only from pairing (Eq. 11). It is not influenced by complementary projection (Eq. 29) 
because any two u( n+m ) = s( n ):^( m ) that differ in v kl also differ in Sj it and hence cannot become equal 
when the Vk t are removed. 

The multi-tape intersection of two relations, 7c\ and 1Z 2 , can be compiled by 

n (n) n n {m) = p K ( I jr>n+kr ( • • • l jun+kl ( K^ xK^) ■■■)) 07) 

jr, k r 

as can been seen from 

n^ x K { ™ ] = { s {n) :v^ I fl W G 7^ n) ,i/ m ) G 7^ m) } (38) 

J J1 , n+fcl (^S n) x 74 m) ) = { S ( "^ (m) | 3 S (") G 7^,3^ G ^ m) , Sil = v kl } (39) 

etc. 

Multi-tape intersection is a generalization of classical intersection of transducers which is known to 
be not necessarily regular (Rabin and Scott, 1959) : 

4 2) n 4 2) = A? n 4 2) = V 3A ( J 2 , 4 ( x li3 ( 4 2) x A 2 2) ) ) ) (40) 

2,2 

Consequently, multi-tape intersection has the same property. In our approach this results from the poten- 
tial non-regularity of auto-intersection (Eq. 37). 

We speak about single-tape intersection if only one tape is used in each relation (r = 1). A well- 
known special case is the intersection of two acceptors (1-tape automata) leading to an acceptor 



[Wn AO- aW rw n. ( T .{ , 

and yielding the relation 



A^nA^ = ^n^ = ^(Xi^AWx^JJ (41) 



7e(4 1)n 4 1} ) = ( s I sGTe(Ai), seK(A 2 ) } (42) 



io(s) = wai(s) ®wa 2 (s) (43) 

Another well-known special case is the composition of two transducers (2-tape automata) leading to 
a transducer. Here, we need, however, an additional complementary projection: 2 

4 2) o4 2) = ?M4 2) n4 2)) = ^(v4 2) x4 2) )) (44) 

It yields the relation: 

n(AfoAf) = { U ^\3s^GnA? ) ),3v {2) enA^),s 2 = v 1 ,n^=V 2t3 (s^:v^)} (45) 
w(u {2) ) = ^i(s (2) ) <8> wa 2 (w (2) ) (46) 

s( 2 ),t;( 2 ) | U1=S1,S2=U1,'U2=«2 

Multi-tape and single-tape intersection are neither associative nor commutative, except for special 
cases with r = n = m, such as the above intersection of acceptors and transducers. 

Composition of transducers Ti is expressed either by the o or the o operator. However, Ti o T2 equals T2 o T\ which 
corresponds to Tt( Ti( ) ) in functional notation (Birkhoff and Bartee, 1970). 
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4.6 Transduction 

A WMTA, A^ n \ can be used as a transducer having r input tapes, j\ to j r , and x output tapes, k\ to k x , 
which do not have to be consecutive or disjoint. 

To apply A^ to a weighted r-tuple of input strings, the tuple s^ r ' is converted into an input WMTA, 
l( r \ having one single path labeled with s^ and weighted with w(s^). An output WMTA, 0^ x \ 
whose relation contains all weighted x-tuples of output strings, !/", is then obtained through multitape- 
intersection and projection: 

O^ = V kl _ kx (A^ n iM) (47) 

31 > 1 



5 Example of Classical Transducer Intersection 

The following example of classical transducer intersection of A\ and A 2 is regular: 3 

* * 

a.b(ca.b\ ee ec e e/abec 

£ A { B e C ) ABCeA i R i A I B e C A 

2,2 

It has one theoretical solution which is 

a b / c a b \ e £ e c e abcabce e/abec 



£ A \B £ CJ ABCeA ABCABCA A^BeCA 

This solution cannot be compiled with any of the above mentioned previous approaches (Section 2). 
It cannot be enabled by any pre-transformation of the WMTAs that does not change their relations, 

(2) (2) 

7Z{A\ ) and 1Z{A 2 ). All above mentioned approaches do not exceed the following alternatives. 

5.1 First Failing Alternative 

One can start by typing all symbols (and e) with respect to the tapes, to make the alphabets of different 

tapes disjoint (which can be omitted for symbols occurring on one tape only) : 

* * 

a b / c a b \ e\ £\ e\ c £\ £\ ( a b e\ c 

e 2 A I B £ 2 CJ ABCe 2 A i i A I B e 2 C A 

2,2 

Then, one converts n tapes into 1 tape, such that each transition, labeled with n symbols, is transformed 
into a sequence of n transitions, labeled with 1 symbol each, which is equivalent to Ganchev's approach 
(Ganchev, Mihov, and Schulz, 2003) : 

a e 2 b A ( c B a e 2 b C) eiAeiB ejC c e 2 eiA n £iA ( a B b e 2 £iC c A) 

After these transformations, it is not possible to obtain the above theoretical solution by means of classi- 
cal intersection of 1-tape automata, even not after e-removal: 

abA(cBabC) ABCcA n A(aBbCcA) 

3 For sake of space and clarity we represent all regular expressions in this section in a special form where each tape appears 
on a different row and symbols of the same transition are vertically aligned. Note that it is not a matrix representation. More 
conventionally A^ could be written as (a,e)(b,A) ( (c, B){a, e)(6, C) )* (e,A)(e,B){e,C){c,e)(e,A). 
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5.2 Second Failing Alternative 

Alternatively, one could start with synchronizing the WMTAs. This is not possible across a whole 
WMTA, but only within "limited sections": in our example this means before, inside, and after the 
cycles: 



a b 

A e 



cab 
B C e 



c e e e 
A B C A 



n 
i, i 

2. 2 



£ 

A 



a b c 
B C A 



Then, one can proceed as before by first typing the symbols with respect to the tapes 



a b 

A e 2 



cab 

B C £ 2 



C £i £i £i 

A B C A 



n 
i, i 

2. 2 



£-1 
A 



a b c 
B C A 



and then transforming n tapes into 1 tape 

* 
aAb£2(cBaCb£2) cA £]B eiC £i A 



n 



£iA ( a B b C c A) 



The solution cannot be compiled with this alternative either, even not after £-removal: 



Ab (cBaCb) cABCA 



n 



a B b C c A) 



5.3 Solution with Our Approach 

To compile multi-tape intersection according to the above procedure (Eq. 37) 



A (2) 



A 



(2) 



n A 



(2) 



1. 1 

2. 2 



V 3A (l 2A (IiMA 2> xAf))) 



(48) 



>(D 



,(2) 



|(2)> 



we proceed in 3 steps. First, we compile B~\*' = li^(A\ x A 2 ) in one single step with an algo- 
rithm that follows the principle of transducer composition and simulates the behaviour of Mohri's £-filter 
(Section 6.3). 4 For the above example, we obtain 



£ a 

£ £ 

£ a 

A B 



b / £ c a b \ 

A £ B £ C 

b £ c a b 

£ V C A B £ / 



£ £ £ C £ 

A B C £ A 

£ £ £ C £ 

C £ £ A £ 



Next, we compile B 2 = 22,4,{B[ ) using our auto-intersection algorithm (Section 6.2) 



£ a 

£ £ 

£ a 

A B 



b / £ c 

A 

b 

£ 



a b \ 

B £ C 

£ c a b 

\ C A B £ J 



1 



£ £ £ C 

A B C £ 

£ £ £ C 



£ 

A 

£ 



C £ 



£ 



4 Composition with e-filter has been shown to work on arbitrary transducers (Mohri, Pereira, and Riley, 1998). 
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and finally, A^ = V^^{ B 2 ' ) with a simple algorithm for complementary projection: 
eab/ecab\ e e e c e 



e e A \ e Be C J ABCe A 
This final result equals the theoretical solution. 

6 Algorithms 

In this section we propose and recall algorithms for the above defined operations on WMTAs: cross- 
product, auto-intersection, single-tape and multi-tape intersection. By convention, our WMTAs have 
only one initial state i £ I, without loss of generality, since for any WMTA with multiple initial states 
there exists a WMTA with a single initial state accepting the same relation. 

We will use the following variables and definitions. The variables u[q], fi[q], etc. serve for assigning 
temporarily additional data to a state q. 

Original weighted automaton from which we will construct a new 
weighted automaton A 

New weighted automaton resulting from a construction 
State qi of an original automaton A\ assigned to a state q of a new 
automaton A 

pair of states (q\,q2) of two original automata, A\ and Ai, as- 
signed to a state q of a new automaton A 

triple of states belonging to the two original automata, A\ and Ai, 
and to a simulated filter automaton, A £ , respectively; assigned to a 
state q of a new automaton A 

Pair of "leftover" substrings (s, u) assigned to a state q of a new 
automaton A 

Delay between two string (or leftover substrings) s and u. For 
example: <5 (£[<?]) also written as 8{q) 

Pair of integers assigned to a state q, expressing the lengths of two 
strings s and u on different tapes of the same path ending at q 
Longest common prefix of the strings s and s' 
Short-hand notation for the projection of the label of e 



We describe two alternative algorithms to compile the cross product of two WMTAs, A™ and A^ ■ The 
second algorithm is almost identical to classical algorithms for crossproduct of automata. Nevertheless, 
we recall it to make this report more complete and self-standing. 

6.1.1 Conditions 

Both algorithms require the semirings of the two original automata, A\ and A^ , to be equal {JC\ = 
K-i). The second algorithm requires the common semiring K = K\ =K,2 to be commutative. 



Aj 




— {Lj,Qj,ij 1 l<j,hj,Lj 


A 




= (X,Q,i,F,E,K) 
= Ql 


/4?] 




= (<?1,<72) 


%] 




= (qi,Q2,q e ) 


m 




= (s,u) 


5(s,'i 


''.) 


= \s\ — \u\ 


x[q] 




= (XhX2) 


lcp(s 


,s') 
Xe) 


=-Pj,K.xm) 


.1 


Cross Product 
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6.1.2 Algorithms 

Cross product through path concatenation: The first algorithm pairs the label of each transition 

e\ G E\ with e( m ) (producing l{e\) : e^), and the label of each transition e 2 G E 2 with e^ (producing 
e (n) :£(e 2 )), and then concatenates A^ +m) with A^ +m) . We will refer to it as CROSSPC(Ai, A 2 ) where 
the suffix PC stands for path concatenation. 

CROSSPC(4 n) ,4 m) )^^ : 

1 A^(H l UX 2 ,Q 1 UQ2,h,F 2 ,E 1 UE2,1Ci) 

2 for Vei G Ei do 

3 £(ei)^£(ei):e( m ) 

4 for Ve 2 G ^2 do 

5 £{e 2 )^e^:£{e 2 ) 

6 for Vg g Fi do 

7 B^EUlfe^),^),^} 

8 g(q) <- 

9 return ^4 



We start with a WMTA A that is equipped with the union of the alphabets, the union of the state 
sets, and the union of the transition sets of A\ and A 2 . The initial state of A equals that A\, its set of 
final states equals that of A 2 , and its semiring equals those of A\ and A 2 (Line 1). First, we (post-) 
pair the labels of all transitions originally coming from A\ with e^ m \ and (pre-) pair the labels of all 
transition from A 2 with e^ n \ Then, we connect all final states of A\ with the initial state of A 2 through 
e( n + m ) -transitions, as is usually done in the concatenation of automata. 

The disadvantages of this algorithm are that the paths of A become longer than in the second algo- 
rithm below and that each transition of A is partially labeled with e, which may increase the running time 
of subsequently applied operations. 

To adapt this algorithm to non-weighted MTAs, one has to remove the weight from Line 7 and re- 
place Line 8 with: Final(q) <— false. 

Cross product through path alignment: The second algorithm pairs each string tuple of A™ with 
each string tuple of A 2 , following the definition (Eq. 18). The algorithm actually pairs each path -k\ 
of A^ with each path -k 2 of A™ transition-wise, and appends e-transitions to the shorter of two paired 
paths, so that both have equal length. We will refer to this algorithm as CROSSPA^i,^) where the 
suffix PA stands for path alignment. 

We start with a WMTA A whose alphabet is the union of the alphabets of A\ and A 2 , whose semiring 
equals those of A\ and A 2 , and that is otherwise empty (Line 1). First, we create the initial state i of 
A from the initial states of A\ and A 2 , and push i onto the stack (Lines 3, 20-26). While the stack is 
not empty, we take states q from it and access the states q\ and q 2 that are assigned to q through p,[q] 
(Lines 4, 5). 
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CROSSPA(4 n \4 m) )^: 

1 A^^U^, 0,^,0,0,^) 

2 Stack <- 

3 i <— getState(zi , i 2 ) 

4 while Stack / do 

5 q^pop(Stack) : //[g] = (gi,g 2 ) 

6 if giy^± A g 2 /L 

7 then for Vei € #(gi) do 

8 for Ve 2 G E{q 2 ) do 

9 g' <— GETSTATE(n(ei),n(e 2 )) 

10 E <- E U { (g, £(ei) :£(e 2 ), io(ei) <8> w(e 2 ), g') 

11 if e (gi)#0 V gi=± 

12 then for Ve 2 € £(g 2 ) do 

13 q' <- GETSTATE(_L,re(e 2 )) 

14 E^EU{(q, eW :£(e 2 ), £>(gi) (8 w(e 2 ), g') } 

15 if e(g 2 )#0 V g 2 = L 

16 then for Vei e E(q 1 ) do 

17 g' <- GETSTATE(n(ei),±) 

18 E^EU{(q, £( ei ):e( m \ w(e 1 )^g(q 2 ), q') } 

19 return ^4 

GETSTATE(gi , g 2 ) — ► g : 

20 if V eg :a*M = («!,©) 

[create new state] 
£fe) 



21 


then g <- g' 


22 


else Q^Qu{g} 


23 


£>(<?) <- e(?i) ^ 


24 


Mfe] <- (91,^2) 


25 


push(Stack, q) 


26 


return g 



If both gx and g 2 are defined (^ ±), we pair each outgoing transition c\ of q\ with each outgoing 
transition e 2 of g 2 (Lines 6-8), and create a transition in A (Line 10) whose label is the pair t(c\) : ^(e 2 ) 
and whose target q' coiTesponds to the tuple of targets (n(ei), n(e 2 )) (Line 9). If q' does not exist yet, it 
is created and pushed onto the stack (Lines 20-26). 

If we encounter a final state gi (with g(qi ) ^ 0) in A\, we follow the path beyond q\ on an e-transition 
that exists only "virtually" but not "physically" in A\ (Lines 11, 12). The target of the resulting transition 
in A corresponds to the tuple of targets (n(ei), n(e 2 )) with n(e\) being undefined (= _L) because e\ does 
not exist physically (Line 13). If we encounter a final state g 2 (with £>(g 2 ) 7^ 0) in A 2 , we proceed similarly 
(Lines 15-18). 

The final weight of an undefined state q = L is assumed to be 1 : g(-L) = 1 . 

To adapt this algorithm to non-weighted MTAs, one has to remove the weights from the Lines 10, 14, 
and 18, and replace Line 23 with: Final(q) <— Final (qi) A Final(q 2 ). 
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6.2 Auto-Intersection 

We propose an algorithm that attempts to constructs the auto-intersection A^ n ' of a WMTA A\ . Our ap- 
proach has some minor similarity with synchronization algorithms for transducers (Frougny and Sakaro- 
vitch, 1993; Mohri, 2003) : it uses the concept of delay between two tapes and assigns leftover-strings 
to states (see above). 

In the context of our approach, we understand by construction the compilation of reachable states 
q and transitions e^ of A^ n \ such that the absolute value of the delay 5(q), regarding tape j and k, 
does not exceed a limit <5 max 2 at any state q, i.e.: Vg : \6(q)\ < 5 max 2 A q reachable. The limit <5 max 2 * s 
imposed, i.e., any state whose delay would exceed it is not constructed. 

We distinguish two cases. In the first case, the delay of none of the reachable and coreachable 
states exceeds a limit <5 max (with 5 mSLX < <5 max2 ), i.e.: ^q : <5 max < \8(q)\ < <5 max2 A q reachable A 
q coreachable. We call it a construction with bounded delay or a successful construction because it 
is guarantied to generate the attempted result A^ n ' = Ij^A^ ). In this case the relation Ij^{A\ l ) 
has bounded delay, too, and is rational. 5 The limit 5 max is not imposed, i.e., any state q whose delay 
exceeds it would still be constructed (which places the construction into the second case if q becomes 
coreachable). 

In the second case, the delay of reachable and coreachable states is potentially unbounded. It exceeds 
c) max , and would actually exceed any limit if it was not (brute-force) delimited by 5 max 2, i.e.: 3q : 5 max < 
\5(q)\ < <5 m ax2 A q reachable A q coreachable. We call this a construction with potentially unbounded 
delay. It is not successful, and we cannot conclude on the correctness of the result A^ 1 ' and on the 
boundedness and rationality of the relation Tjk(A\ ). 

We will first describe the algorithm and then present some examples for further illustration. 

6.2.1 Algorithm 

Our algorithm starts with the compilation of the limits <5 max and <5 max 2 , then proceeds with the construc- 
tion of iW, and finally verifies the success of the construction, according to the above conditions. 

Compilation of limits: First, we traverse A\ recursively, without traversing any state more than 
once, and record three values: 5 max , being the maximal delay at any state, 5 m i n , the minimal delay at 
any state, and 5 cyc , the maximal absolute value of the delay of any cycle (Lines 3, 8-17). To do so, we 
assign to each state q\ of A™ a variable x[Qi] = (xi> X2) with the above defined meaning. The delay at 
a state q\ is 5(q\) = X\~Xi (Lines 8, 9). The delay of a cycle on q\ is the difference between <5'(gi) at 
the end and &(q\) at the beginning of the cycle (Line 1 1). 

Then, we compile 5 cyc , the maximal absolute value of delay required to match any two cycles. For 

example, let 1Z(A\ ) = ({(aa, e)} U {(e, aaa)})*, encoded by two cycles. To obtain a match between 

i-iiir) and ^2(71") of a path tt of A^- 2 ' ^Ti t 2{A\ ), we have to traverse the first cycle 3 times and the second 
two times, allowing for any permutation: A^ = ((aa, e) 3 (e, aaa) 2 U (aa, e) 2 (e, aaa) 2 (aa, e) 1 U . . .)*. 
This illustrates that in a match between any two cycles of A™ , the absolute value of the delay does not 
exceed 5 cyc = <5 cyc • max( 1, 5 cyc — 1 ) (Line 4). 
5 A rational relation is a weighted regular relation. 
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(8., 



maxi "max2) 



m, $cyc) <~ MAXDEL(zi, j, k, (0, 0), (0, 0, 0)) 



GETMAXDELAYSL4i , j, k) 
1 for Vgi G Qi do 
X[qi] ■ ~ J- 

\"maxi "', 

"eye * "eye ' TTlQXyL , cyc 
"max * VHdX[O cyc , "max 0< 
"maxl *• "max \ "eye 

return (5 max , S max2 ) 



1 



MAXDEL(gi ,j,k, (x'l ,X 2 ), ($'max , Cm , ^cj/c)) ~> ($ 

8 <5 max <- max(5' max , X1-X2) 
<5min <- min(5' min , X1-X2) 
if x[<7i] = (xi)X2) / -L 



9 
10 

11 
12 
13 
14 
15 

16 



then S cyc <- max(5' cyc , | 

e\sex[qi}^(x[,x 2 ) 

"eye i "eye 

forVe G -B(gi) do 



mm 5 "mm 



; Ocj/c/ 



[cycle end reached] 
(X1-X2) - (X1-X2) I) 



("maxj "' 



x[<?i 



max; "mini "eye) 



1 



MAXDEL(n(e),j,_A;,(x / k +Ki(e)|,X2 + l4(e)|), 

\"maxt "mini "eye)) 



1/ rsiurn [o maxi o m i n7 o cyc ) 



Next, we compile the first limit, S max , that will not be exceeded by a construction with bounded 
delay. In a match of two cycles this limit equals 5 cyc , and for any other match it is 5 max —6 m i n . In 
a construction with bounded delay, the absolute value of the delay in A^ n ' does therefore not exceed 



max(5 cyc , 5, 



eye 1 "max 



(Line 5). 



Finally, we compile a second limit, 5 ma , x2 , that allows us, in case of potentially unbounded delay, to 
construct a larger A^> than <5 max does. Unboundedness can only result from matching cycles in A^ . To 
obtain a larger A^ n \ with states whose delay exceeds 5 max , we have to unroll the cycles of A± further 



until we reach (at least) one more match between two cycles. Therefore, 5 maX 2 



+S cyc (Line 6). 



Construction: We start with a WMTA A whose alphabet and semiring equal those of A\ and that 
is otherwise empty (Line 2). To each state q that will be created in A, we will assign two variables: 
u[q] = q± indicating the corresponding state q\ in A\, and £[q] = (s, u) stating the leftover string s of tape 
j (yet unmatched in tape k) and the leftover string u of tape k (yet unmatched in tape j). 

Then, we create an initial state i in A and push it onto the stack (Lines 4, 18-27). As long as the 
stack is not empty, we take states q from it and follow each of the outgoing transitions e\ G E(q\) of the 
corresponding state q\ = u[q] in A\ (Lines 5-7). A transition e\ in A\ is represented as e G E{q) in A, 
with the same label and weight. To compile the leftover strings £[q'] = (s', u') of its target q' = n(e) in A, 
we concatenate the leftover strings ^[q] = (s, u) of its source q = p(e) with the j-th and fc-th component 
of its label, £j(e\) and ^fc(ei), and remove the longest common prefix of the resulting strings s ■ £j{e\) 
and u ■ 4(ei) (Lines 8, 14-17). 
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AutoIntersect(^4i , j, k) —>(A, boolean) : 

1 (<Smax, ^max2) «- GETMAXDELAYS(^i , j, k) 

2 A«-(Ei,0,_L,0,0,£i) 

3 Stack <- 

4 i <— GETSTATE(ii , (e, e)) 

5 while Stack / do 

6 g <- pop(Stack) : u[q]=qi, £[g] = (s,u) 

7 forVei G £(gi) do 

8 (s')W') <— CREATELEFT0VERSTRINGS(s,£j(ei),M,4(ei)) 

9 if ( s , = eV«' = e) A (\S(s',u')\ < S max2 ) 

10 then g' <- GETSTATE(n(ei), (s',u')) 

11 B^fiuKg,^),^),^)} 

12 successful <— ( jz3g€Q : I <£(£[?]) I > ^max A coreachable(q) ) 

13 return ( A , successful ) 

createLeftoverStrings(s , si, u , ui) — » (s, u) : 

14 s <— so •si 

15 u <— uo u\ 

16 x <— lcp(s,u) 

17 return (x" 1 s,x _1 u) 

GETSTATE(gi , (s', u')) — >• g : 

18 if 3q> £ Q : v[q'] = Ql A £[g'] = (s>') 

19 then g <- g' 

20 else Q^QU{g} [create new state] 

21 if s = e A 11 = £ 

22 then g(q) <- ^(gi) 

23 else ^(g) <- 

24 i/[g] «- gi 

25 £[?] <-(*'.«') 

26 push(Stack, q) 

27 return g 



If both leftover strings s' and v! of g' are non-empty (^ e) then they are incompatible and the path 
that we are following is invalid. If either s' or u' is empty (= e) then the current path is valid (at least up 
to this point) (Line 9). Only in this case and only if the delay between s' and v! does not exceed <5 ma x2, 
we construct a transition e in A corresponding to e\ in A\ (Line 9, 11). If its target q' = n(e) does not 
exist yet, it is created and pushed onto the stack (Lines 10, 18-27). The infinite unrolling of cycles is 
prevented by 5 max 2- 

Verification: To see whether the construction was successful and whether A^ n > =Ij 1 k C^i )> we 
have to check for the above defined conditions. Since all states of A^ are reachable, it is sufficient to 
verify their delay and coreachability (Line 12) : jz3g : \6(q)\ > <5 max A g coreachable. 
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6.2.2 Examples 

We illustrate the algorithm through the following three examples that stand each for a different class of 
WMTAs. 

(3) 

Example 1: The relation of the WMTA, A\ , of the first example is the infinite set of string tuples 
{(ab k ,xy k z, a k b)\k G N} (Figure 1). Only one of those tuples, namely (ab, xyz, ab), is in the relation 
of the auto-intersection, A^ = Xi^(A\ ), because all other tuples contain different strings on tape 1 
and 3. In the construction, an infinite unrolling of the cycle is prevented by the incompatibility of the 
leftover substrings in £[3] and £[4] respectively. The construction is successful. 

The example is characterized by: 



"max — 0max2 

K{Af) 
?q€Q: \6($[q])\ > <5 max 



{(ab k ,xy k z,a k b) | k G N} 

{(ab ,xy z,a b)} 

successful => rational X\^( ) 



(49) 
(50) 
(51) 
(52) 



a (3) 
Ai 



a:x:£/w 
b:y:a/wj 




(3) 



vf 

u v=l v=l 

^=(b,e) i £=(bb,a) 

b:y:a/w/ AAb:y:a/w, >-~\ 
v=l (J ) y - a " 1 .( 2}- ? --- 1 ~ 3 ' 
^=(a,e) 



8:z:b/w- 



v=2 .' 4 



v=2 

£=(££) 



e:z:b/w- 



5J/p 2 



Figure 1: A WMTA A\ and its successfully constructed auto-intersection A^ 3 ' = X\^{A\ 
(Dashed parts are not constructed.) 



(3), 



(3) 

Example 2: In the second example (Figure 2), the relation of A\ is the infinite set of string tuples 
{(a k ,a,x k y) \ k G N}. Only one of those tuples, namely (a l ,a,x l y), is in the relation of the auto- 
intersection A^ 3 ' = Xi t 2(A\ ). In the construction, an infinite unrolling of the cycle is prevented by 
the limit of delay <5 max 2- Although the result contains states with #(£[<?] )| > S max , none of them is 
coreachable (and would disappear if the result was pruned). The construction is successful. 

The example is characterized by: 



^qeQ: \6(£[q])\ > <5 max A cm 



"max 


= 2 




(53) 


<5max2 


= 3 




(54) 


n{Af ] ) 


= {{a k ,a,x k y) | /fc G N} 




(55) 


K(A®) 


= {(a 1 , a, x^-y}} 




(56) 


hable(q) 


=>■ successful =>■ rational X\\ 


i() 


(57) 
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a:e:x/w 



A (3) 

Aj z:a:y/w, 



IJ/Pi 



(3) 



v =1 5 
£=fe.af 



v=0 v=0 v=0 v=0 v=0 

^=(£,8) ^=(a,e) i;=(aa,£) i;=(aaa,£) ^=(aaaa,e) 

, a:e:x/W /~\ a:£:x/w 0/ ^\ a:£:x/w 0/ ^\ a:e:x/w /"'\ 

j •( 1 ) • ( 2 ) • ( 3 } ~ 4 i 



e:a:y/wj 



e:a:y/wj 




e:a:y/wj 



v=1 (( 6))/p, „ v=1 ( 7 



8:a:y/Wj 



v=l 
£,=(aa,£) 



Figure 2: A WMTA A^ and its successfully constructed auto-intersection A^ = T\^{A\ ) 
(Dashed parts are not constructed. States q marked with ■/ have |<5(£[(/])| > S max .) 



(3) 

Example 3: In the third example (Figure 3), the relation of A t is the infinite set of string tuples 
{(a k a, aa h , x k yz h ) \ k, h G N}. The auto-intersection, Zi^,{A\ ), is not rational and has unbounded 

(3) 

delay. Its complete construction would require an infinite unrolling of the cycles of A\ and an infinite 
number of states in A^ which is prevented by <5 m ax2- The construction is not successful because the 
result contains coreachable states with <5(£[<?])| > S max . 
The example is characterized by: 



2max2 



ft(4 3) ) 
x li2 (ft(4 3) )) 

l h2 {K{Af ] )) D K(A®) 
3q£Q : \8(£[q])\ > £ max A coreachable (q) 



2 (58) 

3 (59) 
{{a k a, aa h , x k yz h ) | k, h G N} (60) 
{(a k a,aa k ,x k yz k ) | A; € N} (61) 
{(a k a,aa k ,x k yz k ) \ k G [0,3]} (62) 
not successful (63) 



A (3) 

Ai 



a:e:x/w 



a:a:y/wj 




e:a:z/w 2 



v=0 
|=(e,e) 



v=0 
^=(a,£) 



v=0 
£=(aa,e) 



v=0 
^=(aaa,£) 



v=0 
^=(aaaa,£) 



(3) 



v=l 



a:£:x/w /-"n a:£:x/w 0/ ^\ a:£:x/w 0/ ^^ a:8:xAv 0/ " -, 

* 4 ; 



a:a:y/wj 



a:a:yAvj 



£=(£,£) V7p 7 

8:a:z/w 2 




a:a:y/w 7 a:a:y/Wj 



8:a:z/w 2 / x x z:a:z/w 2 r\ 8:a:z/w 2 

v=l 115) )' 2 ( 6 > 2 { l" 



v=l 

^=(a,£) 



v=l 

^=(aa,£) 




v=l 
^=(aaa,£) 



8:a:z/w 2 r~~\ 8:a:z/w 2 s~~\ e:a:z/w 2 -'">, 

— ^^K 10) — • (11) — : 12; 



v=l 

£=(£,aa) 



v=l 
£=(£,aaa) 



v=l 

£=(£.aaaa1 



Figure 3: A WMTA A\ and its partially constructed auto-intersection A^> C T\^{A\ ) 
(Dashed parts are not constructed. States q marked with V have |#(£[<?])| > 5 max .) 
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6.3 Single-Tape Intersection 

We propose an algorithm that performs single-tape intersection of two WMTAs, A\ and A 2 , in 

one step. Instead of first building the cross-product, A™ x A^ > an d then deleting most of its paths 
by auto-intersection, 2 jn+ / c ( ), according to the above procedure (Eq. 37), the algorithm constructs 
only the useful part of the cross-product. It is very similar to classical composition of two trans- 
ducers, and incorporates the idea of using an e-filter in the composition of transducers containing e- 
transitions (Mohri, Pereira, and Riley, 1998, Figure 10) that will be explained below. Instead of ex- 
plicitly using an e-filter, we simulate its behaviour in the algorithm. We will refer to the algorithm as 
IntersectCrossEpsC4i , A 2 ,j, k): 

lNTERSECTCROSSEPS(A!,A 2 ,j,fc) = l j;n+k ( A { ^ X A^ ) (64) 

A^ n A { 2 m) = V n+k ( IntersectCrossG4i , A 2 ,j, k) ) (65) 

The complementary projection, V n +k{ )> could be easily integrated into the algorithm in order to avoid 
an additional pass. We keep it apart because IntersectCrossEps( ) serves also as a building block of 
another algorithm where this projection must be postponed. 

6.3.1 Mohri's e-Filter 

(2) (2) 

To compose two transducers, A\ and A 2 , containing e-transitions, Mohri, Pereira, and Riley (1998, 
Figure 10) are using an e-fiiter transducer. In their approach, A^ ' and A 2 ' are pre-processed (Figure 4) : 

(2) (2) 

each e on tape 2 of A\ is replaced by an e\ and each e on tape 1 of A\ by an e 2 - In addition, a looping 
transition labeled with e : (j)\ is added to each state of A\ , and a loop labeled with 4> 2 : e to each state of 

(2) (2) 

A 2 . The pre-processed transducers are then composed with the filter A e in between: A\ o A £ o A 2 . 




e:<h (t» 2 :e 



At x: £i A 2 £2^ 

Figure 4: Mohri's e-filter A £ and two transducers, A\ and A 2 , pre-processed for filtered compo- 
sition, x = -<{4>x,<f>2,£i,£2}- (For didactic reasons we are using slightly different labels than 
Mohri et al). 



The filter controls how e-transitions are composed along each pair of paths in A\ and A 2 respectively. 
As long as there are equal symbols (s or not) on the two paths, they are composed with each other and 



Kempe, Guingne, Nicart. Algorithms for n-Tape Automata. XRCE Report 2004/031 22 

we stay in state of A £ . If we encounter a sequence of e in A\ but not in Ai, we move forward in A\, 
stay in the same state in A2, and in state 1 of A e . If we encounter a sequence of e in A2 but not in A\, 
we move forward in A2, stay in the same state in A\, and in state 2 of A £ . 

6.3.2 Conditions 

Our algorithm requires the semirings of the two WMTAs to be equal (/Ci = /C 2 ) and commutative. All 
transitions must be labeled with ra-tuples of strings not exceeding length 1 on the intersected tapes j of 
A\ and k of A2 which means no loss of generality: Vei G E\ : \£j (ei)| < 1 ; Ve 2 G E% : |4(e 2 )| < 1 

6.3.3 Algorithm 

We start with a WMTA A whose alphabet is the union of the alphabets of A\ and A2, whose semiring 
equals those of A\ and A2, and that is otherwise empty (Line 1). 

lNTERSECTCROSSEPSL4j n) , A 2 m) ,j,k)^A: 

1 A^-<EiUS 2 ,0,±,0,0,/C 1 ) 

2 Stack <- 

3 i <— getState(zi , z 2 , 0) 

4 while Stack / do 

5 q^pop(Stack) : 1%] = (ft, 92, ft) 

6 for Vei G £(ft) do 

7 forVe 2 G £ , (g 2 ) do 

8 if ^(ei)=4(e 2 ) A (ft = V ^-(ei)^e) 

9 then g' <-GETSTATE(n(ei),n(e 2 ),0) 

10 £ <- £ U { (ft £( ei ):£(e 2 ), io(ei) ® w(e 2 ), <?') } 

11 for Vei G £(ft) do 

12 if £j(ei)=e A ft G {0,1} 

13 then g' ^ GETSTATE(n(ei),ft,l) 

14 E^EU{ (q, £{ ei ):e( m \ w{e x ), q') } 

15 for Ve 2 G E{q 2 ) do 

16 if 4(e 2 ) = e A ft G {0,2} 

17 then </ <- GETSTATE(ft , n(e 2 ), 2) 

18 £-£U{(g, e (n) :^(e 2 ), w(e 2 ), c/> } 

19 return ^4 

GETSTATE(ft,g 2 ,ft)^g: 

20 if VGQ:%'] = (ft,g 2) ft) 

21 then g <— g' 

22 else Q <— Q U {</} [create new state] 

23 ^(g) <- g(gi) (8) £>(g 2 ) 

24 %]^(ft,g 2 ,ft) 

25 push(Stack, q) 

26 return g 
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First, we create the initial state i of A from the initial states of A\, A2, and A £ , and push i onto the 
stack (Lines 3, 20-26). While the stack is not empty, we take states q from it and access the states q\, q2, 
and q £ that are assigned to q through i?[g] (Lines 4, 5). 

We intersect each outgoing transition e\ of q\ with each outgoing transition e2 of (72 (Lines 6, 7). 
This succeeds only if the j-th label component of e\ equals the A;-th label component of e^, where j and 
k are the two intersected tapes of A\ and A2 respectively, and if the corresponding transition in A £ has 
target (Line 8). Only if it succeeds, we create a transition in A (Line 10) whose label results from 
pairing l{e{) with Ifa) and whose target q' corresponds with the triple of targets (ra(ei), n(e2), 0). If q' 
does not exist yet, it is created and pushed onto the stack (Lines 20-26). 

Subsequently, we handle all e-transitions in A\ (Lines 11-14) and in A2 (Lines 15-18). If we en- 
counter an e in A\ and are in state or 1 of A £ , we have to move forward in A\, stay in the same state in 
A2, and go to state 1 in A £ . Therefore we create a transition in A whose target corresponds to the triple 
(n(ei), q2, 1) (Lines 11-14). The algorithm works similarly if and e is encountered in A2 (Lines 15-18). 

To adapt this algorithm to non-weighted MTAs, one has to remove the weights from the Lines 10, 14, 
and 18, and replace Line 23 with: Final (q) <— Final(q\) A Final{q2). 

6.4 Multi-Tape Intersection 

We propose two alternative algorithms for the multi-tape intersection of two WMTAs, A™ and A 2 . 

6.4.1 Conditions 

Both algorithms work under the conditions of their underlying basic operations: The semirings of the 
two WMTAs must be equal {K\ =£2) and commutative. The second (more efficient algorithm) requires 
all transitions to be labeled with n-tuples of strings not exceeding length 1 on (at least) one pair of 
intersected tapes ji of A^ and fej of A 2 which means no loss of generality: 3i G [1, r] : ( Vei G E\ : 
|^( ei )|<l)A(V e2 G£2: 14,(62)1 <1) 

6.4.2 Algorithms 

Our first algorithm, that we will refer to as INTERSECT 1L4j , A 2 ,j\... j r , k\ . . . k r ), follows the ex- 
act procedure of multi-tape intersection (Eq. 37), using the algorithms for cross product, auto-intersection, 
and complementary projection. 

Intersect l(A^ n) , A 2 m ' ,ji... j r , k\... k r ) — » (A , boolean) : 

1 successful <— true 

2 A^ CROSSPA(^S n) , A { 2 m) ) 

3 forVi G [l,r] do 

4 (A , success) <— AUTOlNTERSECT(A, ji,n + k{) 

5 successful <— successful A success 

6 A <— Vn+kt,... ,n+Av(^) 

7 return {A , successful) 



The second (more efficient) algorithm, that we will call lNTERSECT2(^4p , A^" 1 ' ,ji-..j r ,ki... k r ), 
uses first the above single-tape intersection algorithm to perform cross product and one auto-intersection 



Kempe, Guingne, Nicart. Algorithms for n-Tape Automata. XRCE Report 2004/031 24 



in one single step (for intersecting tape j\ with k\), and then the auto-intersection algorithm (for inter- 
secting all remaining tapes ji with k{, for i > 1). 

INTERSECT2(A^ ) , A 2 m) ,j 1 ...j r ,k i ... k r ) -> (A , boolean) : 

1 successful <— true 

2 A^lNTERSECTCROSSEPS(^ n) ,^ m) ,ji,A;i) 

3 for Vi e [2, r] do 

4 (A , success) <— AUTOlNTERSECT(A, ji,n + ki) 

5 successful <— successful A success 

6 A<— 'Pn+fci, ... ,n+fcr(^) 

7 return (^ , successful) 



This second algorithm has been used to compile successfully the example of transducer intersection 
in Section 5. 

7 Applications 

Many applications of WMTAs and WMTA operations are possible, such as the morphological analysis 
of Semitic languages or the extraction of words from a bi-lingual dictionary that have equal meaning and 
similar form in the two languages (cognates). 

We include only one example in this report, namely the preservation of intermediate results in trans- 
duction cascades, which actually stands for a large class of applications. 

7.1 Preserving Intermediate Transduction Results 

Transduction cascades have been extensively used in language and speech processing (Ai't-Mokhtar and 
Chanod, 1997; Pereira and Riley, 1997; Kempe, 2000; Kumar and Byrne, 2003; Kempe et al., 2003, 
among many others). 

(2) (2) 

In a (classical) weighted transduction cascade, T x v ' . . . T^ , a set of weighted input strings, encoded 
as a weighted acceptor, L , is composed with the first transducer, T x , on its input tape (Figure 5). The 
output projection of this composition is the first intermediate result, L± , of the cascade. It is further 
composed with the second transducer, T 2 , which leads to the second intermediate result, L 2 , etc. The 
output projection of the last transducer is the final result, L r : 

L« = V 2 (L^ l oTP) for i€[l,r] (66) 

At any point in the cascade, previous results cannot be accessed. This holds also if the cascade is 
composed into a single transducer, T^ 2 \ None of the "incorporated" sub-relations in T^ can refer to a 
sub-relation other than its immediate predecessor: 



T (2) = t[ 2) o...oTP (67) 

e, A™ 1 . . . Ar , that uses WMTAs and multi-tape intersection, 
intermediate results can be preserved and used by all subsequent transductions. Suppose, we want to use 



In a weighted transduction cascade, A™ 1 . . . Ar , that uses WMTAs and multi-tape intersection, 
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(l) 



rJZ) 
1 1 


C 


T (2) 
1 2 


L (1) 


T (2) 
1 r 


tape 1 


tape 1 


tape 1 


tape 2 


tape 2 


tape 2 



(1) 



Figure 5: Weighted transduction cascade (classical) 



the two previous results at each point in the cascade (except in the first transduction) which requires all 

(2) 

intermediate results, L\ , to have two tapes (Figure 6) : The projection of the output-tape of the last 
WMTA is the final result, L^ : 





L? - 


- ^0 


n4 2) 
1,1 








Lf - 


= V 2 ,3 


: *& n Af) ) 


for i e [2, r-1] 




4" ■ 


= Vs( 


Lfl, n 4 3) ) 

2. 2 




L? 


A? 


L? 




A'? 




a (3) 
T (2) A r 






tape 1 


^ 


i>r- tape 1 


U^ 


tape 1 


tape 2 




tape 2 


tape 3 


U tape 3 



L 



(i) 



(68) 
(69) 

(70) 



Figure 6: Weighted transduction cascade using multi-tape intersection (Example 1) 



This augmented descriptive power is also available if the whole cascade is intersected into a single 
WMTA, A^ 2 \ although A^ has only two tapes in our example. This can be achieved by intersecting 
iteratively the first i WMTAs until i reaches r : 



,(3) 



A(->) -T>, , I AS" 1 ) n A\ 

A l...i — ' l,n-l,n{ A l...i-l 'J , A i 



(m) 



.(3) 



n-l, 1 
n, 2 



for i e [2, r] , me {2, 3} 



(71) 



i (3) 



Each Af'i contains all WMTAs from A™ to Af '. The final result A® is built from Af_ 

A^=V hn (A 1 ... r ) 



(72) 



Each (except the first) of the "incorporated" multi-tape sub-relations in A^> will still refer to its two 
predecessors. 
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In our second example of a WMTA cascade, A\ x ' . . . A? , each WMTA uses the output of its 
immediate predecessor, as in a classical cascade (Figure 7). In addition, the last WMTA uses the output 
of the first one: 



Lf = 4"n4 !) 





L? } = 


-PiAL^nA®) 




4 1} = 


p 3 (4-i n 4 3) 

2. 2 


L (1) 


A (2) 
Ai (2) 


A (2) 

A 2 T (2) 



for i G [2, r-1] 



L (2) 

L r-1 



A? 





^D- 

^D^ 


tape 1 


\\^~~~~ H 1 * 




tape 1 -^yA^^- tape 1 ^^^ • • 


tape 2 


tape 2 ^"U ta p e 2 ^""LJ 


tape 3 



(73) 
(74) 
(75) 



L (1) 



Figure 7: Weighted transduction cascade using WMTAs (Example 2) 

As in the previous example, the cascade can be intersected into a single WMTA, A^ 2 \ that exceeds 
the power of a classical transducer cascade, although it has only two tapes: 



A 
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1...J 



A 



(3) 

1...7- 
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